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Abstract 

Mean-payoff games (MPGs) are infinite duration two-player zero-sum games played on weighted 
graphs. Under the hypothesis of perfect information, they admit memoryless optimal strategies for both 
players and can be solved in NP n coNP. MPGs are suitable quantitative models for open reactive 
systems. However, in this context the assumption of perfect information is not always realistic. For 
the partial-observation case, the problem that asks if the first player has an observation-based winning 
strategy that enforces a given threshold on the mean-payoff, is undecidable. In this paper, we study the 
window mean-payoff objectives that were introduced recently as an alternative to the classical mean- 
payoff objectives. We show that, in sharp contrast to the classical mean-payoff objectives, some of the 
window mean-payoff objectives are decidable in games with partial-observation. 


1 Introduction 

Mean-payoff games (MPGs) |13| are infinite duration, two-player, zero-sum games played on weighted graphs, 
useful for modelling reactive systems with quantitative objectives and designing algorithms to synthesize 
controllers for such systems [6]. Like other verification games played on graphs, two players move a token 
around the graph for an infinite number of steps. One of the players selects a label, after which the second 
chooses an edge with this label. The token is then moved along the selected edge. This infinite interaction 
between the two players results in an infinite path in the graph. The objective of Player 1 is to maximize 
the limiting average payoff of the edges (defined by the weights that annotate them) traversed in this infinite 
path, while Player 2 tries to minimize this average. It has been shown in [31IT3] that both players in an MPG 
can play optimally using memoryless strategies, and as a consequence, those games are known to be solvable 
in NP n coNP, and although pseudo-polynomial-time algorithms to solve these games are known [511^. the 
question of whether they can be solved in polynomial-time is an important open question. 

In the version of MPG described above, the game is of perfect information: both players have complete 
knowledge of the history of the play up to the current position of the token. For many applications such 
as controller synthesis, it is often more natural to assume that players have only partial knowledge of the 
current state of the game. In practice, players may model processes with private variables that other players 
(processes) may not see, or controllers that acquire information about their environment using sensors with 
bounded precision, etc. Unfortunately, it has been shown in nni that MPGs games with partial-observation 
are undecidable. 

Window mean-payoff objectives (WMP objectives) were recently introduced in [9] as an alternative to 
the classical MP objectives. In a WMP objective instead of considering the long-run average along the 
whole play, payoffs are considered over a local bounded window sliding along the play. The objective is 
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then to make sure that the average payoff is at least zero over every window. The WMP objectives enjoy 
several nice properties. First, in contrast to classical MP games, we have a polynomial-time algorithm for 
determining WMP games. Second, they can be considered as approximation of the classical MP objectives 
in the following sense: (i) they are a strengthening of the MP objective, i.e. winning for the WMP objective 
implies winning for the MP objective, {ii) if a (finite memory) strategy is forcing a MP with value e > 0 
then that strategy is also enforcing the WMP objective for a window size that is bounded by a function of 
the game and strategy memory sizes. 

In this paper we consider the extension of WMP objectives to games with partial-observation. We show 
that, in sharp contrast with classical MP objectives, some of the WMP objectives are decidable for such 
games. As in [3], we consider several variants of the window MP objectives. For all objectives, we provide 
complete complexity results and optimal algorithms. More precisely, our main contributions are as follows: 

• First, we consider a definition in which the window size is fixed and the sliding window is started at 
the initial move of the game, this is called the direct window objective. For this definition we give an 
optimal EXP-time algorithm (Theorem [3]) in the form of a reduction to a safety game. Additionally, 
we show that this safety game has a nice structure that induces a natural partial order on game 
positions. In turn this partial order can be used to obtain a symbolic algorithm based on the antichain 
approach m- This shows that WMP objectives allow us not only to recover decidability but they also 
lead to games that have the potential to be solved efficiently in practice. The antichain approach has 
already been applied and implemented with success for LTL synthesis [4], omega-regular games with 
partial observation |2], and language inclusion between non-determinisitic Biichi automata nj. 

• Second, we consider two natural prefix-independent definitions for the window objectives, the (uni¬ 
form) fixed window objectives. We also give optimal EXP-time algorithms for these two definitions 
(Theorem [S] and Theorem [S]), when weights are polynomially bounded in the size of the game arena. 
For these objectives, we show that the sets of good abstract plays (sequences of observations) of the 
game with partial observation, for the two definitions, form regular languages whose complements 
can be recognized by non-deterministic Biichi automata of pseudo-polynomial size (Proposition!^ and 
Proposition 0). These automata can then be turned into deterministic parity automata that can be 
used as observers to transform the game of partial-observation into a game of perfect information with 
a parity objective. 

• Finally, we show that, when the size of the window is not fixed but rather left as a parameter, then for 
all the objectives that we consider the decision problems are undecidable (Theorem [5]) . 

2 Preliminaries 

Weighted game arenas. A weighted game arena with partial-observation (WGA, for brevity) is a tuple 
G = {QjqijH, A,w,Obs), where Q is a finite set of states, qj € Q is the initial state, E is a finite set of 
actions, ACQxSxQis the transition relation, w : A —)> Z is the weight function, and Obs € Partition (Q) 
is a set of observations containing {qi}. Let W = max{|u>(t)| : t G A}. We assume A is total, i.e. for every 
{q,<j) G Q X S there exists q' € Q such that {q,a,q') G A. If every element of Obs is a singleton, then we 
say G is a WGA with perfect information and if \Obs\ = 1 we say G is blind. For simplicity, we denote by 
post^(s) = {q' € Q I 3q € s : {q, a, q') G A} the set of cr-successors of a set of states s Q Q. 

In this work, unless explicitly stated otherwise, we depict states from a WGA as circles and transitions as 
arrows labelled by an action-weight pair: tr, a; G S x {—W,... ,W}. Observations are represented by dashed 
boxes and colors, where states with the same color correspond to the same observation. 

Abstract and concrete paths. A concrete path in an WGA is a sequence qoo'oqia'i ... where for alH > 0 
we have qi € Q, ai € H and {qi,ai,qi+i) G A. An abstract path is a sequence ooCToOiCTi ... where Oi G Obs, 
Ui £ Y, and snch that there is a concrete path qou^qiai... for which qi G Oi, for all i. Given an abstract 
path tp, let 'yi'ip) be the set of concrete paths that agree with the observation and action seqnence. Formally 
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7(V') = {<?o<7o9iO'i... I \/i > 0 : qi € Oi and {qi,a,qi+i) € A}. Also, given abstract (respectively concrete) 
path p = ooCTo ■ • ■ and integers k, I we define Tr[k..l] = Ok ■ ■ - Oi, Tr[..k] = 7r[0../c], and 7r[h.] = oiaioi+i .... 
Given a concrete path tt = qocroqiai ..., the payoff up to the (n + l)-th element is given by 

n —1 

w{TT[..n\) = ^ w{qi,ai,qi+i). 

i^O 

If TT is infinite, we define two mean-payoff values MP and MP as: 

MPItt) = liminf i-w(7r[..n]) MP(7r) = limsup Iw(7r[..n]) 

'n —^oo 71 n—¥oo ^ 


Plays and strategies. A play in a WGA G is an infinite abstract path starting at o/ € Obs where qj € oj. 
Denote by Plays(G) the set of all plays and by Prefs(G) the set of all finite prefixes of such plays ending in 
an observation. Let 7(Plays(G)) be the set of concrete paths of all plays in the game, and 7(Prefs(G)) be 
the set of all finite prefixes of all concrete paths. 

An observation-based strategy for Eve is a function from finite prefixes of plays to actions, i.e. A 3 : 
Prefs(G) —E. A play ip = ooaooiai... is consistent with A 3 if Ui = A 3 (V’[..*]) for all i. We say an 
observation-based strategy for Eve A 3 has memory p if there is a set M with \M\ = /r, an element mo € M, 
and functions ■ M x Obs —^ M and Uq '■ M x Obs —^ E such that for any play prefix p = OoCTq ... On we 
have A 3 (p) = ao(m„, o„), where m„ is defined inductively by m^+i = au{mi,Oi) for i > 0 . 


Objectives. An objective for a WGA G is a set Vq of plays, i.e. Vq C Plays(G). We say plays in Vq are 
winning for Eve. Conversely, all plays not in Vq are winning for Adam. We refer to a WGA with a fixed 
objective as a game. Having fixed a game, we say a strategy A is winning for a player if all plays consistent 
with A are winning for that player. Finally, we say that a player wins a game if (s)he has a winning strategy. 
We write V instead of Vg if G is clear from the context. 

Given WGA G and a threshold 1 ^ G Q, the mean-payoff (MP) objectives MPSupg.(z/) = {ip G Plays(G) | 
Vtt G 'y{ip) ■ MP( 7 r) > i/} and MPInf( 7 (z/) = {ip G Plays(G) | Vtt G 'y{ip) ■ MP IttI > i/} require the mean-payoff 
value be at least i/. We omit the subscript in the objective names when the WGA is clear from the context. 
Let 1 / = ^, w' he ‘A weight function mapping t G A to b-w{t) — a, for all such t, and G' be the WGA resulting 
from replacing w' in G for w. We note that Eve wins the MPSupq,( 0) (respectively, MPInfG/(0)) objective 
if and only if she wins MPSup( 3 (:/) (resp., MPInfG(^)). 


3 Window Mean-payoff Objectives 

In what follows we recall the definitions of the window mean-payoff (WMP) objectives introduced in [3] and 
adapt them to the partial-observation setting. For the classical MP objectives Eve is required to ensure the 
long-run average of all concretizations of the play is at least v. WMP objectives correspond to conditions 
which are sufficient for this to be the case. All of them use as a main ingredient the concept of concrete 
paths being “good”. We formalize this notion below. 

Given i > 0 and window size bound Imax G No = N \ {0}, let the set of concrete paths y with the good 
window property be 

GW(l/, i, G>ax) = {x I 3j < > ,y}. 

3 

As in |9], we assume that the value of /max is polynomially bounded by the size of the arena. 

For the first of the WMP objectives Eve is required to ensure that all suffixes of all concretizations of the 
play can be split into concrete paths of length at most /max and average weight at least v. The MP objectives 
are known to be prefix-independent, therefore a prefix-independent version of this first objective is a natural 
objective to consider as well. We study two such candidates. One which asks of Eve that there is some i 
such that all suffixes - after / - of all concretizations of the play can be split in the same way as before. This 
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DirFix C UFix C Fix 

^ ^ ^ 

UDirBnd C DirBnd C UBnd C Bnd C MPInf (c MPSup) 

Figure 1: Implications among the objectives 



Figure 2: Blind WGA where, for any /max € No, 
the only possible abstract play is in Fix(/max) but 
not in UFix(/max)- 


Figure 3: Perfect information WGA where Eve 
wins both MP objectives but none of the FWMP 
or BWMP objectives. 


is quite restrictive since the i is uniform for all concretizations of the play. The second prefix-independent 
version of the objective we consider allows for non-uniformity. 

Formally, the fixed window mean-payoff (FWMP) objectives for a given WGA and threshold G Q are 
defined below. For convenience we denote by ip plays from Plays(G) and concrete plays by tt, i.e. elements 
of 7(Plays(G)). 


DirFix(j^, /max) = {tp I Vtt G 'y{ip),Vi > 0 : tt G GW( f ^max)} 

UFix(l^, /max) = {p’ I > OjVtT G l{lp),'ij > / : TT G GW(l/, j,/max)} 

Fix(l^,/max) = I Vtt G 7(^)1 3/ > 0,Vj > / : tt G GW(l/, j,/max)} 

For the FWMP objectives, we consider /max to be a value that is given as input. Another natural question 
that arises is whether we can remove this input and consider an even weaker objective in which one asks 
if there exists an /max- This is captured in the definition of the bounded window mean-payoff (BWMP) 
objectives which are defined for a given threshold v € Q. 

UDirBnd(i/) = {ip \ 3/max G No,V7r G "f{ip),'ii > 0 : tt G GW(i/,/, /max)} 

DirBnd(i/) = {V- | Vtt G 7 (^/’), 3/max G No,V/ > 0 : tt G GW(i/,/, /max)} 

UBnd(:/) = [ip I 3/max G No,3/ > 0,V7r G ^{ip),'ij > z : tt G GW(zz, j,/max)} 

Bnd(i/) = {ip I Vtt G 7('0), 3/max G Nq, 3z > 0, Vj > z : tt G GW(zz, j, /max)} 

As with the mean-payoff objectives we can assume, without loss of generality, that v = 0. Henceforth, 
we omit v. 

3.1 Relations among objectives 

Figure [T] gives an overview of the relative strengths of each of the objectives and how they relate to the 
mean-payoff objective. The strictness, in general, of most inclusions was established in [9], and Figure [2] 
provides an example for the remaining case between Fix and UFix. 

In general the mean-payoff objective is not sufficient for the FWMP or BWMP objectives, e.g. see 
Figure [3l Our first result shows that if, however. Eve has a finite memory winning strategy for a strictly 
positive threshold, then this strategy is also winning for any of the FWMP or BWMP objectives. A specific 
subcase of this was first observed in Lemma 2 of [9] . 
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Figure 4: Gadget which forces Eve to play in¬ 
finitely many 


su{#},o s,o 



Figure 5: Gadget which, given that Eve will play 
^ infinitely often, forces her to play ^ in intervals 
of bounded length. 


Theorem 1. Given WGA G, if Eve has a finite memory winning strategy for the MPInf[£) (or MPSup{£) ) ob¬ 
jective, for £ > 0, then the same strategy is winning for her in the DirFix(fj,)G game - where p is bounded by 
the memory used by the strategy. 

3.2 Lower bounds 

In [9] it was shown that in multiple dimensions, with arbitrary window size, solving games with the (direct) 
fixed window objective was complete for EXP-time. We now show that in our more general setting this 
hardness result holds, even when the window size is a fixed constant and the weight function is given in 
unary. 

Lemma 1. Let Imax S No be a fixed constant. Given WGA G, determining if Eve has a winning strategy 
for the DirFix(lmax), UFix(lmax) or the Fix{lmax) objectives is F'AP-hard, even for unary weights. 

Proof. We give a reduction from the problem of determining the winner of a safety game with imperfect 
information, shown in [7| to be EXP-complete. 

A safety game with imperfect information is played on a non-weighted game arena with partial-observation 
G = {Q, qi, E, A, Ohs). A play of G is winning for Eve if and only if it never visits the unsafe state set lA Q. 
Without loss of generality, we assume unsafe states are trapping, that is (u, a,q) F A and u Fti imply that 
u = q. 

Let w be the transition weight function mapping (u,a,q) F A to —\\i u F Li and all other t G A to 0. 
Denote by the resulting WGA from adding w to G. It should be clear that Eve wins the safety game G 
if and only if she wins MPInfG„(0), DirFixG„(^max), UFixG„ (^max), and FixG„(Giax) - for any /^ax- That is, 
all objectives are equivalent for Gy^. □ □ 

In [9] the authors show that determining if Eve has a winning strategy in the fc-dimensional version of 
the UDirBnd and UBnd objectives with perfect information is non-primitive recursive hard. We show that, 
in our setting, these decision problems are undecidable. 

Theorem 2. Given WGA G, determining if Eve has a winning strategy for any of the BWMP objectives is 
undecidable, even if G is blind. 


Sketch. Our proof is by a reduction from the universality problem of weighted finite automata which is 
undecidable (Tj. A weighted finite automaton is a tuple Af = {Q,Yi,qi, A,w,F) where E C Q is a set of 
final accepting states. A (accepting) run of the automaton on a word x = a’oo'i .. • cr„ G E* is a sequence 
= 9091 • ■ • 9n F Q'^ such that qn F E and (qi, ai, qi+i) G A for all 0 < i < n. The cost of the run r is 
‘io(r) = w{qi,ai,qi+i). If the automaton is non-deterministic, it may have several runs on x. In that 

case, the cost of x in Af (denoted by Af(x)) is defined as the minimum of the costs of all its accepting runs 
on X. 

The universality problem for weighted automata is to decide whether, for a given automaton Af, the 
following holds: Vx G E* : Af(x) < 0. If so, Af is universal. Our reduction constructs a blind WGA, Gjy, so 
that: 
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Figure 6: Blind gadget to simulate the weighted automaton J\f. 


• if A/” is universal, then Eve has a winning strategy for the objective UDirBnd, 

• if A/” is not universal, then Adam has a winning strategy for the complement of the objective Bnd. 

As shown in Fig. [I] UDirBnd C Bnd and all the other BWMP objectives lie in between those two. So, our 
reduction establishes the undecidability of all BWMP objectives at once. 

Our reduction follows the gadgets given in Figs. HHbl When the game starts, Adam chooses to play from 
one of the three gadgets. As the game is blind for Eve, she does not know what is the choice of Adam and 
so she must be prepared for all possibilities. Note also that as Eve is blind, her strategy can be formalized 
by an infinite word ic € E U {#}“. The first two gadgets force Eve to play a word w such that: (Ci) 
there are infinitely many ^ in re, and {C 2 ) there exists a bound b G N such that the distance between two 
consecutive in w is bounded by b. Note that although (Ci) holds with under a mean-payoff objective, the 
Bnd objective is necessary to ensure ( 02 )- 

Now, we assume that Eve plays a word w = #rci#iC 2 #W 3 #... #Wn# ■ ■ ■ that respects conditions Ci 
and ( 72 , and we consider what happens when Adam plays in the third gadget (Fig. [S]). The best strategy 
for Adam is to simulate the accepting runs of A/". If A/” is not universal then Eve chooses rci G E* such that 
M(wi) > 0 , and then she plays the strategy and she wins the objective UDirBnd. Indeed Eve plays 

a finite memory strategy that forces a mean-payoff which is at least ^ > 0 as each new ^ brings - 1 -^ and 
we know that J\f{wi) > 0 . The result then follows from Theorem [TJ If Af is universal, no matter which word 
w is played by Eve, the mean-payoff of the outcome of the game will be at most as any word wi G S* 
played between two consecutive ^ is such that Af{wi) < — 1. Thus, Adam wins for the complement of the 
mean-payoff objective. From the relation between the mean-payoff objective and Bnd (see Fig. [T]) it follows 
that Adam wins for the complement of the objective Bnd. □ □ 

4 Solving DirFix games 

In this section we establish an upper bound to match our lower bound of Section 13.21 for determining the 
winner of DirFix games. We first observe that for WGAs with perfect information the DirFix(Zi„ax) objective 
has the flavor of a safety objective. Intuitively, a play tt is winning for Eve if every suffix of tt has a prefix 
of length at most /max with average weight of at least 0. As soon as the play reaches a point for which this 
does not hold, Eve loses the play. In WGAs with partial-observation we need to make sure the former holds 
for all concretizations of an abstract play. 

We construct a non-weighted game arena with perfect information G' from G. Eve’s objective in G' will 
consist in ensuring the play never reaches locations in which there is an open window of length /max, for 
some state. This corresponds to a safety objective. Whether Eve wins the new game can be determined in 
time linear w.r.t. the size of the new game (see, e.g. m)- The game will be played on a set of functions J" 
which is described in detail below. We then show how to transfer winning strategies of Eve from G' to G 









and vice versa in Lemmas [S] and HI Hence, this yields an algorithm to determine if Eve wins the DirFix(Z„iax) 
objective which runs in exponential time. 

Theorem 3. Given WGA G, determining if Eve has a winning strategy for the DirFix^l^ax) objective is 
EXP -complete. 


Let us define the functions which will be used as the state space of the game. Intuitively, we keep track 
of the belief of Eve as well as the windows with the minimal weight open at every state of the belieflll 

For the rest of this section let us fix a WGA with partial-observation G and a window size bound 
^max S No. We begin by defining the set of functions T as the set of all functions f : Q ({1,..., Imax} —t 
{—W ■ Imax, ■ ■ ■ ,0}) U {-L}. Denote by supp(/) the support of /, that is the set of states q € Q such that 
f{q) ^ _L. For q G supp(/), we denote by f{q)i the value f{q){i). The function fj G T is such that 
= 0) for all 1 < / < Imax, and fi{q) = ± for all g G Q \ {qi}- Given f\GT and cr G E, we say fiGT 
is a cr-successor of fi if 

• supp(/2) = post^(supp(/i)) n o for some o G Obs] 

• for all q G supp(/ 2 ) and all 1 < j < /max we have that f2{q)j maps to max{—W • /max, minjO, C(g)}}, 
where f{q) is defined as follows 


C(g) 


min fiip)j-i + w{p, a, q) if j > 2 

pesupp(/i), 

(p,(T,g)eA. 

< /i(p)j-i<0 

min w{p,a,q) otherwise. 

pesupp(/i), 

(p.o-.9)eA 


Lemma 2. The number of elements in T is not greater than _ 

We extend the supp operator to finite sequences of functions and actions. In other words, given p' = 
/o^o/iCTi G {J- ■ E)*, supp(p') !->• soCToSicn ... where Si = supp(/i) for all i > 0. In an abuse of notation, we 
define the function supp^^ : (O&s-E)* xiF —(J^-E)* which maps abstract paths to function-action sequences. 
Formally, given p — ooctoOicti • • • G Prefs(G) and p G T with supp(i^) C oq, supp“^(p, (^) /octo/icti ... 
where fo = ip and for all i > 0 we have that /i+i is the CTi-successor of fi such that supp(/i+i) C Oi+i. Both 
supp and supp^^ are extended to infinite sequences in the obvious manner. 

The following two results enunciate the key properties of sequences of the form {E ■ E)*. Intuitively, the 
set of all those sequences corresponds to the windowed, weighted unfolding of G with information about 
reachable states as well as open windows. 

Lemma 3. Let p = oqUo ... o„ be an abstract path, p G E such that supp{p) C oq and supp^^{p,p) = 

fo^o ■ ■ ■ fn G (-A • E)*. A state q G Q is reachable from some state go G supp{p) through a concrete path 

goCTo ...qnF l{p) if and only if q G supp{fn). 

Gonsider an abstract path ip and a positive integer n. We say a window of length / is open at g G "/(ipln]) 
if there is some concretization y of ip[..n] with g = x[n] such that y ^ GW(n — 1,1). 

Lemma 4. Let p = oqito ■ • ■ On be an abstract path, p G E such that supp{p) C oq and supp~^{p,p) = 

/qito ■ ■ ■ fn F {E ■ E)*. Given state p G supp{fn) and 1 < / < Imax such that I < n, then there is a window of 

length I open at p if and only if fnip)i < 0. 

Formally, the arena G' = {E, fi, E, A'). The transition relation A' contains the transition (/i, cr, / 2 ) if /2 
is the cr-successor of /i. Eve, in G', is required to avoid states U = {f G E \ 3g G supp(/) : f{q)i^„^„ < 0}. 

'^The terms belief and knowledge are used to denote a state from any variation of the classic “Reif construction” 1181 to 
turn a game with partial-observation into a game with perfect information. Other names for similar constructions include 
“knowledge-based subset construction” (see e.g. [TO]). 
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Lemma 5. If Eve wins the safety objective in G', then she also wins the DirFix(lmax) objective in G. 

Proof. Assume A' is a winning strategy for Eve in G'. We define a strategy A for her in G as follows: 

A(p) 1-^ A'(supp^i(p,//)) 

for all p € Prefs(G). We claim that A is winning for her in G. Towards a contradiction, assume tf € Plays(G) 
is consistent with A and that ip ^ DirFix(Zr„ax)- Recall that this implies there are n G N, q G Q such that 
there is a window of length Imax open at g S 'y{ip[n]). By Lemma[4]we then get that /„ from supp“^('(/;, //) = 
/oO'o/iO’i ■ • ■ is in G. As supp~^{ip, fj) is consistent with A', this contradicts the assumption that A' was 
winning. □ □ 

Lemma 6. If Eve wins the DirFix(lmax) objective in G, then she also wins the safety objective in G'. 

Proof. Assume A is a winning strategy for Eve in G. We define a strategy A' for her in G' as follows: 

A'(p') !->■ A o obs o supp(p') 

for all p' G Prefs(G'). We claim that A' is winning for her in G'. Again, towards a contradiction, assume 
Ip' G Plays(G') is consistent with A' and that ip' visits some f GU. This implies, by Lemma ID that there 
is a window of length l^ax open at some q G supp(/) in ^ = obs(supp('!/)')). As A is total, for any a G S 
Eve plays then there is valid cr-successor of q that Adam can choose as the next state. Hence there is some 
X G Plays(G) consistent with A such that x ip have the same prefix up where q G 7(x[*q]), and there 
is a concretization tt of y such that 7 r[iq] = q. As y is consistent with A and y ^ DirFix(lmax)) this contradicts 
the fact that it was a winning strategy. □ □ 

4.1 A symbolic algorithm for DirFix games 

We note that state space of the construction G' presented in Section 0] admits an order such that if a state is 
smaller than another state, according to said order, and Eve has a strategy to win from the latter, then she 
has a strategy to win from the former. In this section we formalize this notion by defining the order and, in 
line with mis], propose an antichain-hased algorithm to solve the safety game on G'. 

We define the uncontrollable predecessors operator UPre : R(J^) —?► Pi^) as follows: 

UPre(S') = {p' GT I Vcr e E, 3g' G S' : (p', a, q') G A'}. 

For S G V{E), we denote by pA.(SUUPre(A)), the least fixpoint of the function F : A —>■ SUUPre(A) in the 
p-calculus notation (see [14]). Note that F is defined on the powerset lattice, which is finite. The following 
is a well-known result about the relationship between safety games and the UPre operator (see e.g. |15)1. 

Proposition 1. Eve wins a safety game with unsafe state set lA if and only if the initial state of the game 
is not contained in p,X.{U U UPre(X)). 

Definition 1 (The partial order). Given f',g' G J- we say f ^ g' if and only if supp{f') C supp{g') and 

Vg G supp{f'),'\fi G {1 , . . . , Imax }) dj G {i, . . . , Imax} ■ f {q)i ^ 9 { 9)3 ■ 

An antichain is a non-empty set S G 'P(A) such that for all x,y G S we have x :^y. We denote by 21 

the set of all antichains. Given a, 6 G 21, denote hy a G b the fact that Vx G b,3y G a : y < x. For S G V{iF) 

we denote by [SJ the set of minimal elements of S, that is [SJ ={a:GS | \/y G S : y ^ x implies y = x}. 
Clearly [SJ is an antichain. 

Given S G P{X) we denote by Sf the upward-closure of S, that is Sf = {t G T \ S ^ t}. We say a 
set s G V{iF) is upward-closed if S = Sf- Note that [SJf = Sf and therefore, if S is upward-closed, the 
antichain [SJ is a succinct representation of S. 


Lemma 7. The following assertions hold. 



Figure 7: For n > Z^ax + 1 the abstract path (oq ... On)‘^ is winning for the Fix condition but infinitely often 
visits an unsafe state in the construction from Section |4l 


1. lA is upward-closed. 

2. If S,T G ViJ-) are two upward-closed sets, then SUT is also upward-closed. 

The usual way of showing an antichain algorithm works dictates that we now prove the UPre operator, 
when applied to upward-closed sets, outputs an upward-closed set as well. Unfortunately, this is not true in 
our case. The following example illustrates this difficulty. 

Example 1. Consider the WGA from Figure\^ and let Imax = 2. We note that the function f such that 
/(go) = -L o-nd /(gi)i = 1, /(gi )2 = 0 is in UPre{U). We also have that for the function g such that g{qo) = -L 
and g{qi)i = 0,g(gi)2 = 1 we get that f F g. It is easy to verify that g ^ UPrelfA). Hence, UPre{I() is not 
upward-closed. 

However, we claim that one can circumvent this issue by ignoring elements from U. Thus we are able to 
prove that, under some conditions, UPre does preserve “upward-closedness”. 

Lemma 8. Given upward-closed set S G V{H) and f,gG IF\U, if f G UPre{S) and f F g, then g G UPre{S). 

We define a version of the uncontrollable predecessors’ operator which manipulates antichains instead of 
subsets of F. 


LUPreJ(a) = [{p' GF\U \ \la G G a,3r' G F ■. {p',a,r') GA'Aq'F r'}\ 

Given a, 6 G 21 we denote by aUb the least upper bound of a and b, i.e. aUb = [{g' G F \ q' G a or q' G 6 }J. 
It is easy to check that (a U 6 )t = of U 6 / for any a, 6 € 21. 

Theorem 4. Given WGA G, Eve wins the DirFix{lmax) objective if and only if {q'j} 3 pi-X .{\lA\U'iUPre\{X)). 

5 Solving Fix games 

Since Fix games are a prefix-independent version of DirFix games, it seems logical to consider an analogue 
of the perfect information game from the previous section with a prefix-independent condition. Indeed, the 
reader might be tempted to extend the approach used to solve DirFix games by replacing the safety objective 
with a co-Biichi objective in order to solve UFix or Fix games. However, we observe that although Eve 
winning in the resulting game is sufficient for her to win the original Fix game, it is not necessary. Indeed, an 
abstract play visits states from lA infinitely often if and only if for infinitely many i there is a concretization 
of the play prefix up to i which violates G\N{i,l^ax)- Nevertheless, this does not imply there exists one 
(infinite) concretization of the play which violates GW(*, Zmax) for infinitely many i. Figure [7] illustrates this 
phenomenon. 

For the reasons stated above, we propose to solve Fix games in a different way. We first introduce the 
notion of observer. Let be a deterministic parity automaton!! We say A is an observer for the objective 
V if the language of A is V, i.e. C{A) = V. In [3, the authors show that the synchronized product of G 

^We refer the reader who is not familiar with parity automata or games to m 
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and an observer for y is a parity game of perfect information which is won by Eve if and only if she wins G. 
Thus, it suffices to find an algorithm to construct an observer for Fix(Zmax) to be able to solve Fix games. 

For convenience, we start by describing a non-deterministic machine that accepts as its language the 
complement of Fix(Zniax)- Note that all elements of Fix(/niax) start with the observation {g/} so it suffices to 
describe the machine that accepts any word w € (Y, ■ Obs)‘^ such that {qj} ■ w € Plays(G) \ Fix(lniax)- The 
machine we construct is similar to the one used in to make objectives of imperfect information games 
visible. Intuitively, at each step of the game and after Adam has revealed the next observation we will guess 
his actual choice of state using non-determinism. Additionally, we shall guess whether or not a violating 
window starts at the next step. The state space of the automaton will therefore consist of a single state 
from Q, a negative integer to record the weight of the tracked window, and the length of the current open 
window. 

Formally, let A/" be the automaton consisting of the state space F = Qx{l,, Zmax}x{—IT-Zniax,. .., — 1}U 
{_L}; initial state {qi, 1, T); input alphabet E' = E x Obs; and A" C F x E' x F. The transition relation A" 
has a transition ((p, i, n), (cr, o), (g, j, m)) if (p, a, q) G A, q G o, 

{ w{p, a, q) if w{p, a,q) <0 

n + w{p, a,q) if n ^ ± An + w{p, a,q) <0 Ai < /max 
T otherwise, 


and 



if m = n + w{p, a, q) 
otherwise. 


We say a state {q,i,n) G F is accepting if * = /max and n T. The automaton accepts a word x if and 
only it has a run (go,* 0 )n-o) (ctq, oi) (gi,F,ni) (cti, 02 )... on a; such that for infinitely many j we have that 
(qj, ij , rij ) is accepting. 


Proposition 2. The non-deterministic Biichi automaton Af accepts a word '0 G Plays(G) if and only if 
^0 ^ Pixi^lmax) ■ 


At this point we determinize Af and complement it to get a deterministic automaton with state space of 
size exponential in the size of Af yet has parity index that is polynomial w.r.t. the size of Q (see nzmiiini). 
The synchronized product of G and the observer yields a parity game with the same size bounds. The desired 
result follows from the parity games’ algorithm and results of Un- 


Theorem 5. Given WGA G, determining if Eve has a winning strategy for the Fix(lmax) objective can be 
decided in time exponential in W and the size of G. 


Corollary 1. Given WGA G with unary encoded weights, deciding if Eve has a winning strategy for the 
Fix(lmax) objective is EXP -complete. 


6 Solving UFix games 

In order to determine the winner of UFix games, we proceed as in the previous section by finding a non- 
deterministic Biichi automaton that recognizes the set of bad abstract plays. However, in this case the 
situation is more complicated because a bad abstract play might arise from a violation in the uniformity, 
rather than because of a concrete path with infinitely many window violations. Figure [5] illustrates this 
issue. To overcome this, we first provide an alternative characterization of the bad abstract plays for Eve. 
Consider some if G Plays(G). We say tt G "fiip) merges with infinitely many violating paths if for all f > 0, 
there are j > i, k > j -P /max and some y G 'y{'tlj[..k]) such that 7r[fc] = x[k] and y ^ GW(j, /max). We refer to 
j as the position of the violation and to k as the position of the merge. Our next result formally states the 
relationship between concrete plays merging for multiple violations and UFix games. 
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Lemma 9. Given WGA G and ip G Plays{G), there is ir € "fi'ip) merging with infinitely many violating paths 
if and only if tp ^ UFix{lmax)- 

Proof. (=^>) Assume there is a tt G 7 (V') merging with infinitely many violating paths. We have that there are 
two infinite sequences of indices J = {jo, ji,... } and K = (fco, fci, ■ • ■ } such that ji < j;+i and j; + Imax < h , 
for all I > 0, and for which we know that there is concrete path xi G 'y{'ip[..ki]) such that + ^max] 

realizes an open window of length /max and xii^i] = for all I > 0. Observe that for all / > 0 we have 

that xi ■ '^[ki-.] G xiip) and that xi ' '^[ki--] ^ GW(j;, /max)- In other words, 

V/ > 0, 3a G 7 (^), 3m > I : a ^ GW(m, /max) 
which implies that ip ^ UFix(/max)- 

{<=) Assume 'ip ^ UFix(/max)- We have that there is a infinite sequence of indices J = (jojji;---} 
such that jk < jfc+i, for all fc > 0, and for which we know there is a concrete play tt^ G 7(V') such that 
TTk ^ GW(jfc — /max, ^max), for all k > 0. Observe that for all / > 0 the set 7 (V'[*]) is finite and bounded by 
IQI- Thus, by Pigeonhole Principle we have that, for all n > 0 there is ? 7 „ G (tt^ | 1 < w < |(5| • n} C xi'f’) 
which merges with at least n violating paths. Consider an arbitrary rji. If rji merges with infinitely many 
violating paths then we are done and the claim holds. Otherwise it only merges with a finite number of 
violating paths, say oi. From the previous argument we know there is an rja' G 7(V') that merges with at 
least a'l = oi + 1. Clearly rji and rja' are disjoint at every point after a{, lest rji would merge with a new 
violating path. We inductively repeat the process, if rja'. merges with infinitely many violating paths then we 
are done. Otherwise it only merges with some finite number of violating paths, say Oi. In that case we turn 
our attention to ■ Note that since Q is finite this process can only be done a finite number of times. 
Indeed, after having discarded at most \Q\ — 1 concrete plays (which are disjoint after some finite point) 
it must be the case the last remaining possible concrete play has the desired property or we would have a 
contradiction with our assumptions. Thus, there is some concrete play tt G "/{'ip) that merges with infinitely 
many violating paths. □ □ 

We now construct the non-deterministic Biichi automaton A/”' that recognizes plays that contain a concrete 
path that merges with infinitely many violating paths. The idea is that we non-deterministically keep track 
of two paths: one that will eventually witness a violation and then merge with the other, which ultimately 
serves as the witness for the path that merges with infinitely many violating paths. When the two paths 
merge, the automaton non-deterministically selects a new path to witness the violation. This is achieved by 
non-deterministically selecting any state in the belief set of Eve as these states represent the end states of 
any concrete play consistent with the abstract play so far. To avoid the double exponential associated with 
taking the Reif construction before determinizing the automaton, we instead compute the belief set on-the-fly 
using a Moore machine that feeds into our non-deterministic automaton. By transferring the exponential 
state increase to an exponential increase in the alphabet size, the overall size of the determinized automaton 
(after composition with the Moore machine) will be at most singly exponential in the size of our game and 
W. 

More specifically, denote by B the machine that, given ip = ooaooiai ■ • • G Plays(G') as its input yields 
the infinite sequence ooCToSoOiCTiSi • • • G {Obs • S • V{Q))‘^ such that sq = {qi} and for all / > 0 we have 
Si+i = post^. (si). One can easily give a definition of - which closely resembles a subset construction - 
with a state space at most exponential w.r.t. G. Observe that the machine realizes a continuous function, 
in the sense that every prefix of length i of the input uniquely defines the next annotation. Thus, the 
annotation can be done on-the-fly. 

Formally, Af' consists of the state space F' = Q x Q x {1,, /max} x {—W■ /max, ■ ■., —1} U {T, T}; initial 
state (g/, g/, 1, T); input alphabet E" = S x Obs x V{Q)] and A'" C F x E" x F. The transition relation 
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A'" has a transition {{p,p',i,n),(u,o,s),{q,q',j,m)) if (jj',a,q') € A, q G s, q' € o, 
w{p,a,q) a {p,a,q) € AAw{p,a,q) <0 

n + w{p, (T,q) if {p, a, q) e A A n ^ ± A n + w{p, a,q) <0 Ai < Zmax 
T if {p,a,q) e A A{n^T V p^p') A ±Ai = Zmax 

± otherwise, 

and 

{ ^max if ?Ti = T 

i + 1 if m = n + w{p, a, q) 

1 otherwise. 

We say a state {q, q', i, n) g F' is accepting if q = q', n = T. The automaton accepts a word x if and only it 
has a run {qo, q'^, io, uq) (ctq, oi, si) {qi, q'i,ii, ni) (cti, 02 , S 2 ) ■ • ■ on cc such that for infinitely many j we have 
that {qj,q'pij,nj) is accepting. 

Proposition 3. The non-deterministic Biichi automaton M' accepts a word a = B{ip), where ij) G Plays{G), 
if and only if ip ^ UFix{lmax)- 

We recall that determinizing Af and complementing it yields an exponentially bigger deterministic au¬ 
tomaton. Its composition with B, itself exponentially bigger, accepts the desired set of plays and is still 
singly exponential in the size of the original arena and W. Once more, the desired result follows from the 
algorithm presented in m- 

Theorem 6. Given WGA G, determining if Eve has a winning strategy for the UFix(lmax) objective can be 
decided in time exponential in W and the size of G. 

Corollary 2. Given WGA G with unary encoded weights, deciding if Eve has a winning strategy for the 
UFix{lmax) objective is EXP-complete. 
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A Proof of Theorem [T] 


Proof. In m the authors show that if Eve is only allowed to play finite memory strategies then she wins 
the MPInf(z/) game if and only if she wins the MPSup(z/) game, for any i/ € Q. We show the claim holds 
for MPInf(e). Let A 3 = (M, mo, Ou, Oo) be the deterministic Moore machine representation of Eve’s finite 
memory winning strategy. Consider the product of the arena with Eve’s finite memory winning strategy, 
G X M, constructed in the obvious manner, i.e. every path in G x M corresponds to a concrete path 
consistent with her strategy. Clearly all cycles in G x M have weight of at least e, otherwise Adam can 
create a concrete path with mean-payoff value less than £ by “pumping” the cycles with value less than e. 
As any path in G x M corresponds to concrete plays consistent with Eve’s strategy, this contradicts the fact 
that the strategy is winning for her. By Pigeonhole Principle we have that for any path in G x M: if a 
window opens at step i, then after i there is a sequence of length at most |M||Q| — 1 that is not involved in 
any cycle. Now, since every cycle has weight e > 0, after at most 


W-\M\\Q\ 


■\M\\Q\ 


steps the window will have closed. It follows that for all ijj G Plays(G) consistent with her strategy: 


Vtt G 'y{tp),yi > 0 : TT G GW(*, p) 


which concludes our argument. 


□ 


□ 


B Proof of Theorem [2] 

Proof. We provide a reduction from the universality of weighted finite automata which is uiidecidable [T] . A 
weighted finite automaton is a tuple M = {Q, E, qj, A, w, F) where E C Q is a set of final accepting states. 
A (accepting) run of the automaton on a word x = ctocti ... cr„ G E* is a sequence r = q^qi .. .q-n G such 
that qn € F and Ui, g^+i) G A for all 0 < f < n. The cost of the run r is w{r) = If 

the automaton is non-determiiiistic, it may have several runs on x. In that case, the cost of x in W (denoted 
by Wix)) is defined as the minimum of the costs of all its accepting runs on x. 

The universality problem for weighted automata is to decide whether, for a given automaton Af, the 
following holds: 

Va: G E* : Afix) < 0. 

We construct a blind WGA, Gjq-, so that: 

• if A/” is universal, then Eve has a winning strategy for the objective UDirBnd, 

• if A/” is not universal, then Adam has a winning strategy for the complement of the objective Bnd. 

As shown in Fig. [TJ UDirBnd C Bnd and all the other BWMP objectives lie in between those two. So, our 
reduction establishes the undecidability of all BWMP objectives at once. 

Our reduction follows the gadgets given in Fig. |31[S][ni When the game starts, Adam chooses to play from 
one of the three gadgets. As the game is blind for Eve, she does not know what is the choice of Adam and 
so she must be prepared for all possibilities. Note also that as Eve is blind, her strategy can be formalized 
by an infinite word w G E U Let us show first that the two first gadgets force Eve to play a word w 

such that: 

(Gi) there are infinitely many in w, and 

(G 2 ) there exists a bound 6 G N such that the distance between two consecutive ff in w is bounded by b. 
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Assume that Eve plays a word w = #rei#W 2 #W 3 # ... ■ ■ ■ that respects conditions Ci and C 2 , 

with each Wi G E*. First, if Adam decides to play in the first gadget (Fig. [4]), then either Adam stays in state 
<71 forever, and he does not open any window, or he decides at some point to go from qi to ( 72 , whereupon he 
does open a window. However, after at most b steps Adam has to leave (72 for (73 at the next occurence of the 
^ symbol, the bound b is guaranteed by C 2 - After at most b additional steps, the open window will be closed 
as the self loop on (73 is labeled with the weight +1. So in this case. Eve wins the objective UDirBnd. Second, 
if Adam decides to play in the second gadget (Fig. [S]), then he can go from 54 to (75 on the # symbols. The 
windows that open on those transitions will all close within b steps according to condition C 2 and the game 
moves back to 54 . So again, Eve wins for the objective UDirBnd. 

Now assume that Eve plays a word w that violates either condition Ci or condition C 2 - First, if w violates 
Cl, then Adam chooses the first gadget (Fig.0]), and just after Eve has played her last Adam moves from 
<71 to 52 - As there will be no ^ anymore, Adam can loop on 52 and the window that he has opened will 
never close. Hence, Adam wins for the complement of the objective Bnd. Second, if w violates C 2 then there 
exists an infinite sequence of indices H < *2 < ■ • • < < ■ • ■ such that \ < \wi^ | < • • • < \'Wi^ \ < .... 

Then Adam can read this sequence of sub-words using runs of the form (74 ( 55 ) *( 74 . Each such run will open 
a window that closes at the end of the sub-word. But as the sequence of lengths of the sub-words is strictly 
increasing and infinite, Adam wins for the complement of the objective Bnd. 

Now, we will assume that Eve plays a word w = ■ ■ ■ that respects conditions 

Cl and C 2 , and we consider what happens when Adam plays in the third gadget (Fig. [S]). As we will see, in 
this gadget, the best strategy for Adam is to simulate the accepting runs of JV. 

Assume first that automaton Af is non-universal. Then by definition, there exists a finite word wi G E* 
such that all accepting runs of A/” on rci have a non-negative value, i.e. A/'(ici) > 0. In that case, w = {^wi)‘^ 
is a finite memory winning strategy for Eve for the objective Bnd. Indeed, if Adam simulates accepting runs 
on w then the mean-payoff of the outcome is at least > q as each new ^ brings and we know 
that Af{wi) > 0. So Eve wins for the objective UDirBnd by Theorem [U as Eve obtains a strictly positive 
mean-payoff bounded away from zero with a finite memory strategy. Now, if Adam chooses not to follow 
accepting runs then the game ends up in state T from which the mean-payoff is equal to 1 , so we can conclude 
using similar arguments that Eve wins for the objective UDirBnd. Thus in all these cases. Eve wins for the 
objective UDirBnd. 

Finally, assume that automaton J\f is universal and let us show then that Adam has a winning strategy 
for the complement of the Bnd objective. Indeed, if Eve plays a word w = #wi#rc 2 #'W 3 #... ■ ■ ■ 

that respects conditions Ci and C 2 , then we know that Niwi) < 0 for each z < 0. On such word, Adam can 
follow accepting runs in the gadget of Fig. [51 As the length between two consecutive # is at most b, we know 
that the mean-payoff of the run constructed by Adam is less than or equal to It follows that Adam 

wins the complement of the Bnd objective as claimed, as Bnd objective implies the mean-payoff objectives 
(as shown in Fig. [T] ). □ □ 


C Missing proofs from Section |4] 


Lemma 2. The number of elements in T is not greater than . 

Proof. 

1^1 < 


IQMma 

I 

_ 2 I ^ I '^max'iosCi^'^max) 




□ □ 

Lemma 3. Let p = oqito ... o„ be an abstract path, p G T such that supp{(p) C oq and supp~^{p,(p) = 
/oO'o ■ ■ ■ fn G {J- ■ E)*. A state q G Q is reachable from some state qo G supp{(f) through a concrete path 
qoao ...qnG 7(p) if and only if q G supp[fn). 
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Proof. (=4>) We proceed by induction. We will show that for all 0 < j < n, for all qj £ supp(/j) there is 
a concrete path go^o ■ ■ - qj such that qk G Ok for all 1 < fc < j and go G supp(vj). Note that for j = 0 
the claim trivially holds. Assume the claim holds for j. From the definition of cr-successor and supp“^ we 
have that supp(/j+i) = post^^ (supp(/j)) C Oj+i. This means that for all g^+i £ supp(/j+i) there must be 
some qj £ supp(/j) such that (g^,cr^,gj+i) £ A. Hence any qj+i is reachable from some qj via aj which, by 
inductive hypothesis, is in turn reachable from some go £ supp((/3) via a concrete path of the desired form. 

(«^=) We now show - once more by induction on j — that for all 0 < j < n, ii there is a concrete path 
goCTo ■ ■ .qj such that go £ supp((^) and qk £ Ok for all 1 < fc < j, then qj £ supp(/j). The claim holds for j = 0. 
Assume that it holds for some j. From the assumptions we have that (g^, Uj, gj+i) £ A and g^+i £ Ok+i- 
Further, we know that qj £ supp(/j) by inductive hypothesis. Hence, g^+i £ post,,, . (supp(/j)) C Oj+i which 
means that g^+i £ supp(/j+i). □ □ 

Lemma 4. Let p = oqcfo ■ ■ - On be an abstract path, p ^ T such that suppijp) C oq and supp~^{p,(p) = 
focro ■ ■ ■ fn & (J' ■ S)*. Given state p £ supp(/„) and 1 < I < Imax such that I < n, then there is a window of 
length I open at p if and only if fn{p)i < 0. 

Instead of directly providing a proof of Lemma |4l we prove a more general result below. Consider the 
three conditions stated in Lemma [TUI We shall prove that that Cl => C2 => C3 ^ Cl. Since Cl corresponds 
to having a window of length I open at p from supp((p), the desired result follows from transitivity. 

Lemma 10. Let p = ogao ... o„ be an abstract path, ip G iF such that supp{p) C og and supp^^{p,p) = 
fgag ■ ■ ■ fn S (F ■ E)*. Given state p £ supp(/„) and 1 < I < Imax such that I < n, let X = n — 1. The 
following three statements are equivalent. 

Gl. There is a concrete path qgcrg ... g„ £ 7(p) with qn = P o-nd qg £ supp{p) and 

m 

< 0 

j=n-l 


for all n — I < m < n. 

G2. fn{p)i < 0. 

G3. There is a concrete path qgUg .. .qn & l{p) with qn = P o-nd qg £ supp[p) such that 

(a) fj(qj)j-x < 0 for all X < j < n, and 

(b) fk{qk)j-\ +w{qk,ak,qk+i) = /fe+i(gfe+i)fc-A+i for all X < k < n. 

Proof. (C3 Cl) We will apply induction on TO. From the definition of cr-successor we have that/A+i(gA+i)i = 
min{0, w(gA, cta, gA-i-i)}- From assumption (a) we know that /A+i(gA-i-i)i < 0. Thus, the claim holds for 
TO = A. Assume it holds for to. To conclude the proof, we now show that the claim holds for to -|- 1 as well. 

m+l 

wiQj, CTj, qj + l) = fm (^m )m —A + w{qm,crm,qm+i) ind. hyp. 

j=n-l 

= fmiqm)m-X+l from (6) 

< 0 from (a). 

(Cl ^ C2) We show, by induction on to, that for all A < to < n 

m 

fm+l{qmi)m — X+l A ) O’i ) 9j+l ) • 
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The desired result follows. As the base case, consider m = X and note that by definition of cr-successor we 
have that 


/a+i((?a+i)i =min({0}U{w(p,crA,(?A+i) | p € supp{fx) A {p,ax,q\+i) £ A}) 

< w{qx,cr\,q\+i)- 

Thus the claim holds. Assume that the claim is true for m. From the definition of cr-successor we have that 

/m-t-2(^m-t-2)A —m-t-2 ^ /m-t-1 (^m-t-1 )m—A-t-1 T '^{qm+l t ^m+1 t qm+ 2 ') • 

From the inductive hypothesis we get have that the right hand side of the inequality is equivalent to 

m+1 

Thus the claim holds for m -f 1 as well. 

(C2 => C3) We inductively construct a concrete path go^ro • ■ ■ qnl{p) with qn = p and go S supp((p) such 
that 

(1) fn-k{qn-k)i-k < 0 for all 0 < /c < I, and 

(2) fn—k{jln—k)l — k — fc-t-1 , ^n—k+ 1 ^ qn—k) “t“ fn—k+l{qn—k+l)l — k+l for all 1 ^ < 1. 

As these conditions are equivalent to (a)-(6) from C3, the result follows. Note that for fc = 0 we have that 
(1) holds trivially since p G supp(/ji) and fn{p)i < 0 by hypothesis. If fn-k{qn-k)i-k < 0 then, by definition 
of cr-successor, it follows that there is some g' G supp(/„_fe_|_i) C On-k+i such that /„_fe_|_i(g');_fc+i < 0 and 
fn-k{qn-k)i-k = w(g„-fc+i,cr„_fe+i,g„_fe) -|- /„_fc+i(g„_fc+i)i_fc+i. In other words, g' is the source of the 
minimal cr„_fe_|_i-transition of a state from supp(/n,_fe_|_i) to qn-k- Let qn-k+i = q' ■ Continue in this fashion 
defining every qi up to g„_;. Now, from Lemma[31 we have that g„_; is reachable from some state in supp((/3) 
via a concrete path of the desired form. Any such path is a valid prefix for the sequence g„_/cr„_; ... g„ we 
constructed above. □ □ 

Lemma 8. Given upward-closed set S G 7^(-F) and f,gG J-\U, if f € UPre{S) and f d: 9, then g G UPre{S). 

Proof. We have that for all cr, there is ha- € S such that (/, cr, ha) G A'. By construction of A' we also know 
that there is ia such that {g,a,ia) G A', and furthermore, since supp(/) C supp(g), we get that 

supp(/i,^) = post^(supp(/)) n o 
C post^(supp(g)) no 
= supp(v) 

for some o G Obs. Note that: 

(1) since f,g^U, then f{p)u^^ = g{p)u^^ = 0 for all p G supp(/); and 

(2) ia{q)i = ha{q)i for all g G supp(/icr). 

From (1) and since f 9, there is a function a : {1,..., ^max} {!,...,/max ~ 1} such that for all 
1 < a: < /max we have that a{x) > x and f{p)x A 9{p)a{x) holds for all p G supp(/). Observe that for all 
g G supp(/io.) and any 2 < x < /maxi we have that 

ha{q)x = min ({0} U {f{p)x-i + w{p, a, q) \ f{p)x-i < 0} 

pesupp(/) 

> min ({0}U{g{p)a(x-i)+w{p,a,q) \ g{p)c,ix-i) < 0} 
pesupp(/) 

— ^(7 (g)a(3; —■ 

It follows that ha ^ ia and that, since S is upward-closed, ia G S. Thus, we have shown that for all ct, there 
is icr G S' such that (g, cr, ia) G A', which implies that g G UPre(S). □ □ 
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D Proof of Theorem [4] 

Lemma 11. Given upward-closed set S G ViJ-), lUPre\{[S\) = lUPre{S) \WJ. 

Proof. We first show that if / € [UPreJ([5'J) then / G UPre(S') \ U. We have that f and Vct G G 

G F : (f,a,r'^) G A' and q' < r'^. Since S is upward-closed and q' :< r'^, we know that r'^ G S. 
Hence, we get that Vcr G S,3r^ G S : {f,a,r'^) G A', which implies that / G UPre(S') \ L{. 

Next, we show that if / G [UPre(S') \ then f G {p' G J- \ U \ Vcr G E, 3g' G a, 3r' G : {p',cr,r') G 
A' and q' ^ r'}. We know that f and Vcr G E, 3r' G Q : cr, r') G A'. By definition of [S'], we know 

there is qr' G [S'] such that q^' ^ r' . Thus, we get that Vcr G E,3r^ G S,3qr' G [5] : {f,a,r') G A' and 
qr' A r'. 

Finally, we note that if / G [UPreJ([S'J) then not only is it true that / G UPre(S') \ U^ but furthermore 
/ G [UPre(S') \U\. Indeed, if this were not the case, then there would be 5 G [UPre(S') such that g ^ f 

and f ^ g. Then, by the argument explained in the previous paragraph, this would contradict minimality 
of / in [UPreJ([S'J). Similarly, if / G [UPre(S') \U\ then / G [UPreJ([S'J), as otherwise, by the argument 
from the first paragraph of the proof, minimality in the first set would be contradicted. Thus, the claim 
holds. □ □ 

Theorem 4. Given WGA G, Eve wins the DirFix{lmax) objective if and only if{qj} 2 tJ-X.{lG\\J[UPre\{X)). 

Proof. We note that for any upward-closed set S C P such that U C S we have, from Lemma H] that 
U U UPre(S') is again upward-closed and a superset oiU. In fact, it holds that 

WUUPre(5') = {[U\ U [UPre(S')J)t 

= {\U\ U [UPreJ ([S'] )t from Lemma [TTJ 

It is easy to show by induction that gX.{U U UPre(A)) = (pA.([Wj U [UPreJ([Xj)))t. Thus, {q\] 
gX.{\U\ U [UPreJ([S'J)) if and only if qj ^ qiX.{U U UPre(S')). From Proposition [T] and Lemmas [S] and [0] we 
know this is the case if and only if Eve has a winning strategy in the safety game in G' if and only if she 
wins the DirFix(Zmax) objective in G. □ □ 

E Proof of Proposition [2] 

Proof. (=J>) Assume J\f accepts ip- Let r = (^Oj*o,^o)(o’0!oi)('Zi;^i)(o'ii02)... be one of the accepting 
runs of the automaton on ip. By construction of M we have that go'^oQiO'i • • • G ')(ip). Let tt^ denote this 
concrete play and J = {jo,jiG2 ,... } be an infinite set of indices such that jk < jfc+i and {qj^AjkPjk) 
accepting for all fc > 0 . Such a sequence is guaranteed to exist since r is accepting. One can easily verify by 
induction on the definition of A" that for all fc > 0 it holds that tt^. ^ G'W{ij^. — Zmax, Imax)- It follows that 
Vm > 0 , 371 > TO : TTj. ^ GW(n, /max), which concludes our argument. 

(<^) Assume that ip = ooCToOiCi • • • ^ Fix(/max). Let tt = qoaoqiai G ^{ip) be the concrete play such 
that for infinitely many i it is the case that tt pP G'W{i, /max). We describe the infinite run oi Af on ip that 
accepts. Let J = {jo, ji, j2,... } be an infinite set of indices such that jk + /max < jk+i and tt ^ GW(jfc, /max) 
for k > 0 . The sequence is guaranteed to exist because of our choice of tt. Observe that this implies there 
is a run r = (go,*o,''^o) (o’o,oi) {qi,ii,ni) ((71,02)... of the automaton where for all /c > 0 we have that 
Uj^+i = w{qj^,(Jj^,qj^+i) and for all 1 < / < /max then 


'^jk+i ~ '^jk+i-i P '^iQjk+lPjk+l■>Qjk+l+^)■ 

FuTtheTlnoTe, in this run it holds that for all /c > 0 we have = /max- Hence, said run is such that 

for all fc > 0 the state *jy+;max, ^^ifc+imax) accepting. We conclude that the automaton accepts 

Ip. ““ ““ □ □ 
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F Proof of Proposition [3] 

Proof. (=J>) Assume Af accepts a. Let r = {qo, q'oio, no) (ao, oi, si) (gi,q{ii,ni) (ui, 02, S2) ■ • ■ be one of the 
accepting runs of the automaton on a. By construction of Af we have that gpcro^lcri • • • S 7(V’)- Let tt^ 
denote this concrete play, J = {joj Ji) ■ • ■ } cind K = {fco, /ci,... } be two infinite sets of indices such that 
ji < ji+i and ji + Imax £ ki, for all I > 0, and for which we know that 

• , q'kt : iki, nki ) is accepting for alU > 0, and 

• ^Ji+imax < 0 A = Imax- 

Such sequences are guaranteed to exist since r is accepting. Assuming the correctness of B, one can easily 
verify by induction on the definition of A'" that for all Z > 0 we have that nr, at ki merges with a path 
having a violation at ji. It follows that merges with infinitely many violating paths. From Lemma [HI we 
get that V' ^ UFix(/i„ax)- 

(<;=) Assume that = oo<JoOiai ■ • • ^ UFix(Zmax)- Let n = qoooqicri € 7(V') be the concrete play that 
merges with infinitely many violating paths (see Lemma 1^1 We describe the infinite run of W on a = S(V') 
that accepts. Let J = } and K = (kg, ki,... } be two infinite sets of indices such that ji < jf+i and 

Ji+^max+1 < ki, for all I > 0, and for which we know that there is some xi € 'y{^p[.■kl\) such that n[ki\ = 
and for all jk < m < j/c + Zmax +1 we have w(x[jk--ni]) < 0. The sequences are guaranteed to exist because of 
our choice of n. Observe that this implies there is a run r = {qo,qoio, no){cro,oi,si){qi,q[ii,ni){ai, 02 , S2) ■ • ■ 
of the automaton where for all Z > 0 we have that Ujj+i = w{qj,, aj^, qji+i) and for all 1 < 6 < Z^ax then 

"^ji+b — njl-^-b-l + 'w{qjl^b,n'j,.^-b,qjl+b+l)■ 

Furthermore, we have that = T for all Z^ax A b < ki and qki = q'k^ ■ Hence, said run is such that for all 
Z > 0 the state iqki,qkiAki,nk,) is accepting. We conclude that the automaton accepts "0. □ □ 
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